# Real-time Speech Transcription
Whisper Large V3 Turbo
MIT
Whisper is OpenAI's state-of-the-art automatic speech recognition (ASR) and speech translation model, trained on over 5 million hours of labeled data with strong zero-shot generalization capabilities. The Turbo version is a pruned and fine-tuned variant of the original, reducing decoder layers from 32 to 4, significantly improving speed with a slight quality trade-off.
Speech Recognition
Transformers Supports Multiple Languages

W
unsloth
94
1
Whisper Large V3
Apache-2.0
Whisper is OpenAI's state-of-the-art automatic speech recognition (ASR) and speech translation model, supporting multiple languages
Speech Recognition
Safetensors Supports Multiple Languages
W
unsloth
4,002
1
Gigaam Ctc
MIT
GigaAM-v2-CTC is a Russian automatic speech recognition (ASR) model trained using the CTC loss function and can be utilized via the Hugging Face transformers library.
Speech Recognition
Transformers Other

G
waveletdeboshir
255
1
Whisper Large V3 Atco2 Asr
Apache-2.0
A speech recognition model fine-tuned based on OpenAI Whisper-large-v3, specializing in Air Traffic Control (ATCO) scenarios with a word error rate of 17.04%
Speech Recognition
Transformers

W
jlvdoorn
1,792
5
Whisper Kannada Tiny
Apache-2.0
A Kannada automatic speech recognition model fine-tuned based on openai/whisper-tiny, trained on multiple public Kannada ASR corpora
Speech Recognition Other
W
vasista22
119
6
Whisper Tiny
Apache-2.0
Whisper Tiny is an automatic speech recognition (ASR) model developed by OpenAI, the smallest version in the Whisper series with 39M parameters.
Speech Recognition Supports Multiple Languages
W
openai
328.82k
318
Wav2vec2 Large Xls R 300m Singlish Colab
Apache-2.0
A speech recognition model fine-tuned on the Singapore English (li_singlish) dataset based on facebook/wav2vec2-xls-r-300m
Speech Recognition
Transformers

W
RuiqianLi
22
1
Wav2vec2 Xls R 2b 22 To 16
Apache-2.0
Facebook's Wav2Vec2 XLS-R model fine-tuned for multilingual speech translation tasks, supporting mutual translation between 22 input languages and 16 output languages.
Speech Recognition
Transformers Supports Multiple Languages

W
facebook
38
14
Featured Recommended AI Models